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Abstract. We study the computational complexity of the FO model 
checking problem on interval graphs, i.e., intersection graphs of intervals 
on the real line. The main positive result is that this problem can be 
solved in time 0(n log n) for n- vertex interval graphs with representa- 
tions containing only intervals with lengths from a prescribed finite set. 
We complement this result by showing that the same is not true if the 
lengths are restricted to any set that is dense in some open subset, e.g., 
in the set (1, 1 + e). 

Keywords: FO model checking; parameterized complexity; interval graph; clique- 
width 

1 Introduction 

Results on the existence of an efficient algorithm for a class of problems have re- 
cently attracted a significant amount of attention. Such results are now referred 
to as algorithmic meta-theorems, see a recent survey |14j . The most prominent 
example is a theorem of Courcelle p] asserting that every MSO property can be 
model checked in linear time on the class of graphs with bounded tree-width. 
Another example is a theorem of Courcelle, Makowski and Rotics [5] assert- 
ing that the same conclusion holds for graphs with bounded clique-width when 
quantification is restricted to vertices and their subsets. 

In this paper, we focus on more restricted graph properties, specifically those 
expressible in the first order logic. Clearly, every such property can be tested in 
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polynomial time if we allow the degree of the polynomial to depend on the 
property of interest. But can these properties be tested in so-called fixed param- 
eter tractable (FPT [B]) time, i.e., in polynomial time where the degree of the 
polynomial does not depend on the considered property? The first result in this 
direction could be that of Seese [5D] : Every FO property can be tested in linear 
time on graphs with bounded maximum degree. A breakthrough result of Frick 
and Grohe [11] asserts that every FO property can be tested in almost linear 
time on classes of graphs with locally bounded tree- width. Here, an almost linear 
algorithm stands for an algorithm running in time 0(n 1+e ) for every e > 0. A 
generalization to graph classes locally excluding a minor (with worse running 
time) was later obtained by Dawar, Grohe and Kreutzer [4]. 

Research in this direction so far culminated in establishing that every FO 
property can be tested in almost linear time on classes of graphs with locally 
bounded expansion, as shown (independently) by Dawar and Kreutzer [5] (also 
see [13] for the complete proof), and by Dvorak, Krai' and Thomas [7]. The 
concept of graph classes with bounded expansion has recently been introduced 
by Nesetfil and Ossona de Mendez [16117118] ; examples of such graph classes 
include classes of graphs with bounded maximum degree or proper minor-closed 
classes of graphs. A holy grail of this area is establishing the fixed parameter 
tractability of testing FO properties on nowhere-dense classes of graphs. 

In this work, we investigate whether structural properties which do not yield 
(locally) bounded width parameters could lead to similar results. Specifically, we 
study the intersection graphs of intervals on the real line, which are also called 
interval graphs. When we restrict to unit interval graphs, i.e., intersection graphs 
of intervals with unit lengths, one can easily deduce the existence of a linear time 
algorithm for testing FO properties from Gaifman's theorem, using the result of 
Courcelle et al [2] and that of Lozin [T5] asserting that every proper hereditary 
subclass of unit interval graphs, in particular, the class of unit interval graphs 
with bounded radius, has bounded clique-width. This observation is a starting 
point for our research presented in this paper. 

Let us now give a definition. For a set L of reals, an interval graph is called 
an L-interval graph if it is an intersection graph of intervals with lengths from 
L. For example, unit interval graphs are {l}-interval graphs. If L is a finite set 
of rationals, then any L-interval graph with bounded radius has bounded clique- 
width (see Section[4]for further details). So, FO properties of such graphs can be 
tested in the fixed parameter way. However, if L is not a set of rationals, there 
exist L-interval graphs with bounded radius and unbounded clique-width, and 
so the easy argument above does not apply. 

Our main algorithmic result says that every FO property can be tested in 
time 0(n log n) for L-intcrval graphs when L is any finite set of reals. To prove 
this result, we employ a well-known relation of FO properties to Ehrenfeucht- 
Frai'sse games. Specifically, we show using the notion of game trees, which we 
introduce, that there exists an algorithm transforming an input L-interval graph 
to another L-interval graph that has bounded maximum degree and that satisfies 
the same properties expressible by FO sentences with bounded quantifier rank. 
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On the negative side, we show that if L is an infinite set that is dense in 
some open set, then //-interval graphs can be used to model arbitrary graphs. 
Specifically, we show that L-interval graphs for such sets L allow polynomially 
bounded FO interpretations of all graphs. Consequently, testing FO properties 
for L-intervals graphs for such sets L is W[2]-hard (see Corollary 5.2). In ad- 
dition, we show that unit interval graphs allow polynomially bounded MSO 
interpretations of all graphs. 

The property of being W[2]-hard comes from the theory of parameterized 
complexity [BJ, and it is equivalent to saying that the considered problem is 
at least as hard as the d- dominating set problem, asking for an existence of a 
dominating set of fixed parameter size d in a graph. It is known that, unless the 
Exponential time hypothesis fails, W[2]-hard problems cannot have polynomial 
algorithms with the degree a constant independent of the parameter (of the 
considered FO property in our case). 

In Section [2] we introduce the notation and the computational model used 
in the paper. In the following section, we present an O(nlogn) algorithm for 
deciding FO properties of L-interval graphs for finite sets L. In Section |1J we 
present proofs of the facts mentioned above on the clique-width of L-interval 
graphs with bounded radius. Finally, we establish FO interpretability of graphs 
in L-interval graphs for sets L which are dense in an open set in Section [5j 



2 Notation 

An interval graph is a graph G such that every vertex v of G can be associated 
with an interval J(v) = [£(v),r(v)) such that two vertices v and v' of G are 
adjacent if and only if J(v) and J(v') intersect (it can be easily shown that the 
considered class of graphs remains the same regardless of whether we consider 
open, half-open or closed intervals in the definition). We refer to such an assign- 
ment of intervals to the vertices of G as a representation of G. The point £(v) is 
the left end point of the interval J(v ) and r(v) is its right end point. 

If L is a set of reals and r(v) — £(v) £ L for every vertex v, we say that G 
is an L-interval graph and we say that the representation is an L -representation 
of G. For example, if L = {1}, we speak about unit interval graphs. Finally, if 
r(v) — £(v) € L and < £(v) < r(v) < d for some real d, i.e., all intervals are 
subintervals of [0, d), we speak about (L, d)-interval graphs. Note that if G is an 
interval graph of radius k, then G is also an (L, (2k + 1) max L) -interval graph 
(we use maxL and minL to denote the maximum and the minimum elements, 
respectively, of the set L). 

We now introduce two technical definitions related to manipulating intervals 
and their lengths. These definitions are needed in the next section. If L is a set 
of reals, then L( fe ) is the set of all integer linear combinations of numbers from 
L with the sum of the absolute values of their coefficients bounded by k. For 
instance, = {0} and LW = L U {0}. An L-distance of two intervals [a, b) 
and [c, d) is the smallest k such that c — a G L^ k \ If no such k exists, then the 
L-distance of two intervals is defined to be oo. 
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Since we do not restrict our attention to L-interval graphs where L is a 
set of rationals, we should specify the computational model considered. We use 
the standard RAM model with infinite arithmetic precision and unit cost of all 
arithmetic operation, but we refrain from trying to exploit the power of this com- 
putational model by encoding any data in the numbers we store. In particular, 
we only store the end points of the intervals in the considered representations of 
graphs in numerical variables with infinite precision. 

2.1 Clique- width 

We now briefly present the notion of clique- width introduced in [5] . Our results 
on interval graphs related to this notion are given in Section [4j A fc-labeled 
graph is a graph with vertices that are assigned integers (called labels) from 1 
to k. The clique-width of a graph G equals the minimum k such that G can 
be obtained from single vertex graphs with label 1 using the following four 
operations: relabeling all vertices with label i to j, adding all edges between the 
vertices with label i and the vertices with label j (i and j can be the same), 
creating a vertex labeled with 1, and taking a disjoint union of graphs obtained 
using these operations. 

2.2 First Order Properties 

In this subsection, we introduce concepts from logic and model theory which we 
use. A first order (FO) sentence is a formula with no free variables with the usual 
logical connectives and quantification allowed only over variables for elements 
(vertices in the case of graphs). A monadic second order (MSO) sentence is a 
formula with no free variables with the usual logical connectives where, unlike 
in FO sentences, quantification over subsets of elements is allowed. An FO prop- 
erty is a property expressible by an FO sentence; similarly, an MSO property 
is a property expressible by an MSO sentence. Finally, the quantifier rank of a 
formula is the maximum number of nested quantifiers in it. 

FO sentences are closely related to the so-called Ehrenfeucht-Fraisse games. 
The d-round Ehrenfeucht-Fraisse game is played on two relational structures R\ 
and i?2 by two players referred to as the spoiler and the duplicator. In each of 
the d rounds, the spoiler chooses an element in one of the structures and the 
duplicator chooses an element in the other. Let Xi be the element of R\ chosen 
in the z-th round and yi be the element of Ri. We say that the duplicator wins 
the game if there is a strategy for the duplicator such that the substructure of 
Ri induced by the elements x±, . . . , Xd is always isomorphic to the substructure 
of i?2 induced by the elements y%, . . . , yd, with the isomorphism mapping each 
Xi to j/i. 

The following theorem |8|10j relates this notion to FO sentences of quantifier 
rank at most d. 

Theorem 2.1. Let d be an integer. The following statements are equivalent for 
any two structures R and R' : 
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— The structures R and R! satisfy the same FO sentences of quantifier rank at 
most d. 

— The duplicator wins the d-round Ehrenfeucht-Fraisse game for R and R' . 

We describe possible courses of the d-round Ehrenfeucht-Fraisse game on a 
single relational structure by a rooted tree, which we call a d-EF-tree. All the 
leaves of an d-EF-tree are at depth d and each of them is associated with a 
relational structure with elements labeled from 1 to d. The d-EF-tree T for the 
game played on a relational structure R is obtained as follows. The number of 
children of every internal node of T is the number of elements of R and the 
edges leaving the internal node to its children are associated in the one-to-one 
way with the elements of R. So, every path from the root of T to a leaf u of 
T yields a sequence x\ , . . . , Xd of the elements of R (those associated with the 
edges of that path) and the substructure of R induced by x\, . . . ,Xd with Xj 
labeled with i is the one associated with the leaf u. 

A mapping / from a d-EF-tree T to another d-EF-tree T' is an EF-homomor- 
phism if the following three conditions hold: 

1. if u is a parent of v in T, then f{u) is a parent of f(v) in T\ and 

2. if u is a leaf of T, then f(u) is a leaf of T', and 

3. the structures associated with u and f(u) are isomorphic through the bijec- 
tion given by the labelings. 

Two trees T and T' are EF-equivalent if there exist an EF-homomorphism from 
T to T' and an EF-homomorphism from T 1 to T. 

Let T and T' be the d-EF-trees for the game played on relational structures 
R and R' , respectively. Suppose that T and T' are EF-equivalent and let /i and 
fi be the EF-homomorphism from T to T' and T' to T, respectively. We claim 
that the duplicator wins the d-round Ehrenfeucht-Fraisse game for R and R' . 
Let us describe a possible strategy for the duplicator. We restrict to the case 
when the spoiler chooses an element Xi of R in the i-th step, assuming elements 
Xi, . . . , Xi-i in R and elements yx, . . . , yi~\ in R' have been chosen in the previous 
rounds. Let Uq, . . . ,Ui be the path in T corresponding to x%, . . . , x^. The dupli- 
cator chooses the element yi in R' that corresponds to the edge 
in T' ■ It can be verified that if the duplicator follows this strategy, then the 
substructures of R and R' induced by x\, . . . , Xd and y±, . . . , yd, respectively, are 
isomorphic. 

Let us summarize our findings from the previous paragraph. 

Theorem 2.2. Let d be an integer. If the d-EF-trees for the game played on 
two relational structure R and R' are EF-equivalent, then the duplicator wins 
the d-round Ehrenfeucht-Fraisse game for R and R' . 

The converse implication, i.e., that if the duplicator can win the d-round 
Ehrenfeucht-Fraisse game for R and R' , then the d-EF-trees for the game played 
on relational structures R and R' are EF-equivalent, is also true, but we omit 
further details as we only need the implication given in Theorem 2.2 in our 
considerations. 
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We finish this section with some observations on minimal d-EF-trees in EF- 
equivalence classes. Let T be a d-EF-tree. Suppose that an internal node at 
level d — 1 has two children, which are leaves, associated with the same labeled 
structure. Observe that deleting one of them yields an EF-equivalent d-EF-tree. 
Suppose that we have deleted all such leaves and an internal node at level d — 2 
has two children with their subtrees isomorphic (in the usual sense). Again, 
deleting one of them (together with its subtree) yields an EF-equivalent d-EF- 
tree. So, if T' is a minimal subtree of T that is EF-equivalent to T and K is 
the number of non-isomorphic d-labeled structures, then the degree of nodes at 
depth d — 1 does not exceed K, those at depth d — 2 does not exceed 2 K , those 
at depth d — 3 does not exceed 2 2 , etc. We conclude that the size of a minimal 
subtree of T that is EF-equivalent to T is bounded by a function of d and the 
type of relational structures considered only. 

3 FO Model Checking 

Using Theorems |2.1| and |2.2| we prove the following kernelization result for L- 
interval graphs. 

Theorem 3.1. For every finite subset L of reals and every d, there exists an 
integer Kq and an algorithm A with the following properties. The input of A is 
an L -representation of an n-vertex L-interval graph G and A outputs in time 
O(nlogn) an L -representation of an induced subgraph G of G such that 

— every unit interval contains at most Kq left end points of the intervals cor- 
responding to vertices of G' , and 

— G and G' satisfy the same FO sentences with quantifier rank at most d. 

Proof. We first focus on proving the existence of the number Kq and the sub- 
graph G' and we postpone the algorithmic considerations to the end of the proof. 

As the first step, we show that we can assume that all the left end points 
are distinct. Choose S to be the minimum distance between distinct end points 
of intervals in the representation. Suppose that the intervals are sorted by their 
left end points (resolving ties arbitrarily). Shifting the i-th interval by iS/2n, for 
i = 1, . . . , n, to the right does not change the graph represented by the intervals 
and all the end points become distinct. 

Choose e to be the minimum positive element of L 1 -- 2 + \ Fix any real a and 
let X be the set of all intervals [x, x + e) such that x — a € L^ 2 '. By the choice 
of e, the intervals of I are disjoint. In addition, the set I is finite (since L is 
finite). Let W be the set of vertices w of G such that £(w) lies in an interval 
from I, and for such a vertex w, let i(w) be the left end point of that interval 
from I. Define a linear order on W such that w < w' for w and w' from W iff 
£(w) — i(w) < £(w') — i(w') and resolve the cases of equality for distinct vertices 
w and w' arbitrarily. 
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We view W as a linearly ordered set with elements associated with intervals 
from 1 (specifically the interval with the left end point at i(w)) as well as asso- 
ciated with the lengths of their corresponding intervals in the representation of 
G. Let us establish the following claim. 

Claim. There exists a number K depending on only |X| and d such that if W 
contains more than K elements associated with the interval [a, a + e), then there 
exists an element w € W associated with [a, a + e) such that the d-EF-trees for 
the game played on W and W \ {w} are EF-equivalent. 

Indeed, let T be the d-EF-tree for the game played on W and let T' a 
minimal subtree of T that is EF-equivalent to T. Recall that the size of T' does 
not exceed a number K depending on only \X\ and d. If W contains more than 
K elements associated with [a, a + e), then one of them is not associated with 
edges that are present in T' ■ We set w to be this element. This finishes the proof 
of the claim. 

Since the d-EF-trees for the game played on W and W \ {w} are EF- 
equivalent, the duplicator wins the d-round Ehrcnfcucht-Fraisse game for W 



and W \ {w} by Theorems 2.1 and 2.2 



We now describe a strategy for the duplicator to win the d-round Ehrenfeucht- 
Frai'sse game for the graphs G and G\w. During the game, some intervals from 
X will be marked as altered. At the beginning, the only altered interval is the 
interval [a,a + e). 

The duplicator strategy in the i-th round of the game is the following. 

— If the spoiler chooses a vertex w' with £(w') in an interval of X at L-distancc 
at most 2 d+1 ~ l from an altered interval, then the duplicator follows its win- 
ning strategy for the d-round Ehrenfeucht-Frai'sse game for W and W \ {w} 
which gives the vertex to choose in the other graph. In addition, the dupli- 
cator marks the interval of I that contains £(w') as altered. Note that the 
interval of the chosen vertex has its left end point in the same interval of I 
as w' . 

— Otherwise, the duplicator chooses the same vertex in the other graph. No 
new intervals are marked as altered. 

It remains to argue that the subgraphs of G and G \ w obtained in this way 
are isomorphic. Let w\, . . . , Wd be the chosen vertices of G and w[, . . . ,w' d the 
chosen vertices of G' . For brevity, let us refer to vertices corresponding to the 
intervals with left end points in the altered intervals as to altered vertices. If Wi 
is not altered, then Wi — w[. If Wi is altered, then £(wi) and ^(w-) are in the 
same interval J € I and the only intervals that might intersect the intervals 
corresponding to Wi and w\ differently are those with left end points in the 
intervals of I at L-distance at most two from J. However, if some chosen vertices 
have their left end points in such intervals, then these intervals must also be 
altered and these chosen vertices are altered. Since we have followed a winning 
strategy for the duplicator for W and W \ {w} when choosing altered vertices, 
the subgraphs of G and G\w induced by the altered vertices are isomorphic. We 
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conclude that the subgraphs of G and G\w induced by the vertices w±, . . . , Wd 
and w[, . . . ,w' d , respectively, are isomorphic. So, the duplicator wins the game. 

Let us summarize our findings. If an interval of length s contains more than 
K left end points of intervals in the given L-representation of G, then one of 
the vertices corresponding to these intervals can be removed from G without 
changing the set of FO sentences with rank at most d that are satisfied by G. 
So, the statement of the theorem is true with Kq set to Xfe -1 ]. 

It remains to consider the algorithmic aspects of the theorem. The values of 
£ and Kq are determined by L and d. The algorithm sorts the left end points 
of all the intervals (this requires O(nlogn) time) and for each of these points 
computes the distance to the left end of the interval that is K positions later 
in the obtained order. If all these distances are at least e, then every interval of 
length at most e contains at most Kq left end points of the intervals and the 
representation is of the desired form. 

Otherwise, we choose the smallest of these distances and consider the corre- 
sponding interval [a, &), b — a < e, containing at least Kq left end points of the 
intervals from the representation. By the choice of this interval, any interval of 
length b — a at L-distance at most 2 d+1 from [a, b] contains at most Kq + 1 left 
end points of the intervals from the representation. So, the size of the d-EF-tree 
for the game played on the vertices v with £(v) in such intervals is bounded by a 
function of Kq, d and \L\. Since this quantity is independent of the input graph, 
we can identify in constant time a vertex w with t(w) £ [a, b) whose removal 
from G does not change the set of FO sentences with quantifier rank d satisfied 
by G. 

We then update the order of the left end points and the at most Kq computed 
distances affected by removing w, and iterate the whole process. Since at each 
step we alter at most Kq distances, using a heap to store the computed distances 
and choose the smallest of them requires O(logn) time per vertex removal. So, 
the running time of the algorithm is bounded by 0(n log n). □ 

It is possible to think of several strategies to efficiently decide FO properties 
of L-interval graphs given Theorem |3.1[ We present one of them. Fix an FO 
sentence with quantifier rank d and apply the algorithm from Theorem |3.1| to 
get an L-interval graph and a representation of this graph such that every unit 
interval contains at most K left end points of the intervals of the representation. 
After this preprocessing step, every vertex of the new graph has at most + 
l)[maxL] neighbors. In particular, the maximum degree of the new graph is 
bounded. The result of Seese [5D] asserts that every FO property can be decided 
in linear time for graphs with bounded maximum degree, and so we conclude: 

Theorem 3.2. For every finite subset L of reals and every FO sentence <P, there 
exists an algorithm running in time O(nlogn) that decides whether an input n- 
vertex L-interval graph G given by its L-representation satisfies <S>. 
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4 Clique-width of Interval Graphs 



Unit interval graphs can have unbounded clique- width [12] , but Lozin [15] noted 
that every proper hereditary subclass of unit interval graphs has bounded clique- 
width. In particular, the class of ({1}, d)-interval graphs has bounded clique- 
width for every d > 0. Using Gaifman's theorem, it follows that testing FO 
properties of unit interval graphs can be performed in linear time if the input 
graph is given by its {l}-representation with the left end points of the intervals 
sorted. We provide an easy extension of this, and outline how it can be used to 
prove the special case of our main result for FO model checking when L is a 
finite set of rational numbers 

Lemma 4.1. Let L be a finite set of positive rational numbers. For any d > 0, 
the class of (L,d) -interval graphs has bounded clique-width. 

Proof. Let a be the largest rational number such that every element of L is an 
integer multiple of a. Without loss of generality, we can assume that d is not a 
multiple of a (otherwise, we slightly increase d). We show that the clique- width 
of any (L, d)-interval graph is at most K := \d/a] + 1. 

Let G be an (L, d)-interval graph with vertices v\, . . . , v n . Consider an (L, d)- 
representation of G. Let bi be the smallest non- negative real such that £{vi) — bi 
is a multiple of a. By a slight perturbation, we may assume that all the numbers 
bi are distinct. Without loss of generality, we can also assume that < &i < 
■••<&„< a. 

We will now proceed in several steps. After the i-th step, we will have con- 
structed the subgraph of G induced by the vertices vt,...,Vi such that the label 
of the vertex Uj is \i(vi)/a\. In the first step, we insert the vertex V\ with la- 
bel \£(vi)/a~\. In the i-th step, we insert the vertex i>i with label K, join it by 
edges to all vertices with labels between \t(vi)/a\ and \r(vi)/a], and relabel it 
to \£(vi)/a]. By the choice of a and the assumption that b\ < ■ ■ ■ < b n , the 
vertex Vi is adjacent to its neighbors among v\, . . . , i>j_i. □ 

From Lemma |4.1| and Gaifman's theorem, one can approach the FO model 
checking problem on L-interval graphs with L containing rational numbers only 
as follows. L-interval graphs with radius d are (L, (2d+l) maxL)-interval graphs. 
By Gaifman's theorem, every FO model checking instance can be reduced to 
model checking of basic local FO sentences, i.e., to FO model checking on L- 
interval graphs with bounded radius. Since such graphs have bounded clique- 
width, the latter can be solved in linear time by [lj. Combining this with the 
covering technique from |11] , which can be adapted to run in linear time in the 
case of L-interval graphs, we obtain the following. 

Corollary 4.2. Let L be a finite set of positive rational numbers. The FO model 
checking problem can be solved in linear time on the class of L-interval graphs 
if the input graph is given by its L- representation with the left end points of the 
intervals sorted. 
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However, Corollary |4.2| is just a fortunate special case, since aside of rational 
lengths one can prove the following. 

Lemma 4.3. For any irrational q > there is d such that the class of ({1, q}, a) - 
interval graphs has unbounded clique-width. 

Proof. This proof is in a sense complementary to that of Lemma |4.1| We may 
assume q > 1 (otherwise, we rescale and consider the set {1, 1/q}). So, fix L = 
{l,q}, d = q + 3 and an integer n (to be specified later). 

Our task is to construct a ({1, q}, <i)-interval representation of a graph G 
with large clique- width. Since q is irrational, for every I we can find n such that 
iW n [0, d — q) contains more than £ points. We actually construct an arbitrarily 
long sequence P — (a\, <22, . . . , a n ) of such points as follows: a\ = 0, 02 = 1, and 
for i > 2 set 

— a,i = Oj— 1 + 1) provided that |ai_2 — ft<— 1| = 1 and a;_i < d — 2, 

— a, = asj_i — 1, provided that a^2 — = 9, and 

— dj = a,_i — (7 otherwise (we call this cij a q-element of P). 

Informally, we are "folding" a long sequence with differences from into a 
bounded length interval, avoiding as much collisions of points as possible. 

Let S > be such that nS is smaller than the smallest number in ZA n ) n 
(0, d — q). Let us introduce the following shorthand notation: if J is an interval 
and r a real, then J + r is the interval J shifted by r to the right. Similarly, if 
1 is a set of intervals, then 1 + r is the set of the intervals from I shifted by r 
to the right. We define sets of intervals U\ := {[iS, 1 + iS) : i = 0, . . . , n — 1} and 
U q := {[iS, q + i5) : i = 0, . . . , n — 1}. For further reference we say that intervals 
[iS, 1 + iS) or [id, q + iS) are at level i. 

For i = 1, . . . , n, we set Wj = U q +ai if is a g-element of P, and Wj = Ui+a { 
otherwise. Then every interval of W, is a subinterval of [0, d). Let G be a graph 
on n 2 vertices represented by the union of the interval sets Wi U W2 U • • • U W„. 
Let Wi, i = l,...,n, be the vertices represented by Wi- We claim that the 
clique-width of G exceeds any fixed number k £ N when n sufficiently large. 

Assume, for a contradiction, that the clique-width of G is at most k. We can 
view the construction of G as a binary tree and conclude a fc-labeled subgraph 
d of G with |n 2 < |V(Gi)| < |?i 2 appeared during the construction of G. 
However, this implies that vertices of Gi have at most k different neighborhoods 
in G \ V{G\). We will show that this is not possible (assuming that n is large). 

For 2 < i < n, vertices x £ Wi-\ and y £ Wi are mates if they are represented 
by copies of the same-level intervals from U\ or U q above. Our first observation is 
that, up to symmetry between i-1 and i, < |Wi_inV(G ! i)|-|W r inV - (G ! i)| < k. 
Suppose not. Then there exist k + 1 vertices in Wi-i n V{G\) whose mates are 
in Wi \ V(G\), and thus certify pairwise distinct neighborhoods of the former 
ones in G\V{G X ). 

A set Wi is crossing G\ if 7^ Wi n V{G\) 7^ Wi. The arguments given in 
the previous paragraph and |n 2 < |V(Gi)| < |n 2 imply that for any m, if n is 
large, there exist sets Wi ,Wi 0+ i, . . . , Wi 0+m in G all crossing G\. So, we can 
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select an arbitrarily large index set / C {i , . . . , i$ + m — 1}, |/| = £, such that 
for each i € J the element dj+i is to the right of a,-, and that all intervals in 
(J,. £/ VVj share a common point. In particular, a.; is not a g-element and so both 
Wj and Wi+i arc shifted copies oili\. Let ii, . . . , be the elements of X ordered 
according to the (strictly) increasing values of <Zj, i.e., < • • • < a^. 

Finally, for any j,f £ {1, . . . ,£} such that f > j + 1, we see that each vertex 
of W^. fW(Gi) cannot have the same neighborhood as any vertex of Wi , C\V(Gi): 
this is witnessed by the non-empty set W ij+1+ \ \ V(G{) (represented to the right 
of the intervals from Wy while intersecting every interval from Wj , ). Therefore, 
the vertices of G\ have at least 1/2 > k distinct neighborhoods in G \ V(Gi), 
which contradicts the fact that the clique-width of G is at most k. □ 

5 Graph Interpretation in Interval Graphs 

A useful tool when solving the model checking problem on a class of structures 
is the ability to "efficiently translate" an instance of the problem to a different 
class of structures, for which we already may have an efficient model checking 
algorithm. To this end we introduce simple FO graph interpretation, which is an 
instance of the general concept of interpretability of logic theories [TU] restricted 
to simple graphs with vertices represented by singletons. 

An FO graph interpretation is a pair 1 = (v, /i) of FO formulae (with 1 and 
2 free variables respectively) where [i is symmetric, i.e., G \= /i(x,y) fj,(y,x) 
in every graph G. If G is a graph, then 1(G) is the graph defined as follows: 

— The vertex set of 1(G) is the set of all vertices v of G such that G |= ^(f), 
and 

— the edge set of 1(G) is the set of all the pairs {u, v} of vertices of G such 
that G \= v(u) A u(v) A /i(u, v). 

We say that a class C\ of graphs has an FO interpretation in a class C2 if there 
exists an FO graph interpretation 1 such that every graph from C\ is isomorphic 
to 1(G) for some G E C 2 . 

Lemma 5.1. If L is a subset of non-negative reals that is dense in some non- 
empty open set, then there exists a polynomially bounded simple FO interpreta- 
tion of the class of all graphs in the class of L-interval graphs. 

Proof. By scaling, we can assume that L is dense in [1, 1 + e) for some e > 0. 
To simplify our considerations, we also assume that L = [1,1 + e]; it can be 
routinely verified that the lengths of intervals appearing in the representation of 
the graph H we construct can be slightly perturbed, preserving the structure of 
the graph. 

Let G be a simple graph with n > 2 vertices (the case n = 1 can be handled 
separately) and let vi,...,v n be its vertices. Further, let S = We are going 
to construct an L-interval graph H with 3n+5+\E(G)\ vertices, and FO formulas 
[i and v such that (/1, v) is a simple FO interpretation of G in H. The formulas 
[i and v will be independent of G and H. 
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The vertex set of H will be formed by sets V\, V% and V3 (each of them 
containing n + 1 vertices), a set W (containing |-E(G)| vertices) and two special 
vertices a and b. Specifically, the set Vi, i — 1,2,3, contains vertices fcy, j = 
0, . . . , n, and the set W contains vertices ej t j> for all pairs 1 < j < j' < n such 
that VjVf € E(G). 

We now describe an L-representation of H. To simplify our description, we 
use closed intervals but an appropriate perturbation can translate the represen- 
tation to the model of half-open intervals considered elsewhere in the paper. 

— For i — 1, 2, 3 and j = 0,1, ... ,n, the interval corresponding to t i j is the 
unit interval [i — 1 + (i + j)d, i + (i + j)S]. 

— The interval corresponding to a is the unit interval [0, 1]. 

— The interval corresponding to b is the unit interval [(n + 2)5, 1 + (n + 2)5]. 

— The interval corresponding to a vertex ej k is the interval [1 + (1 + j)6, 2 + 
(k + 3)6]. 

An illustration of the construction can be found in Figure [TJ Observe that the 
vertices a and ti t o € V\ are twins, i.e., they have the same neighbors in H. The 
vertex b is adjacent to every vertex in V\ U V2 U {a} U W. 



b 




a 



Fig. 1. Construction of the interval representation of the graph H in the proof of 
Lemma 15. II 

Our aim is to interpret the vertices of G in the set V{ = {ti i, . . . , ti n }. Since 
the vertices a and tip are the only twins in the constructed graph H, we can 
identify any one of them by the following formula: 

anchor(a;) = By (x ^ y A edge(x, y) A Vz 7^ x, y. edge(x, z) <h> edge(y, z)) . 

In what follows, we refer to these two vertices as anchors. 

Let dist(x, y) = c for an integer c stand for an FO formula expressing that the 
distance of x and y is c, and let adist(x) = c stand for an FO formula expressing 
that the distance of x from an anchor is c, i.e., 

adist(x) — c = 3y(anchor(y) A dist(x, y) = c) . 
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Using the just denned notation, the following formula is true for exactly the 
vertices of V{: 



/j,(x) = -i anchor(x) A adist(x) = 1 A By (adist(y) — 2 A -i edge(x, y)) . 

Note that the latter part makes n(x) false for x — b. So, H \= fi(t) iff t € V{ as 
needed. 

Notice that the set V3 contains exactly the vertices at distance three or four 
from the anchors. We say that a vertex x' = t 3 y e V3 is the mate of a vertex 
x = tij G V{ if j = j' . The following formula is true iff x' is the mate of x. 

mates(x, x 1 ) = fi(x) A (adist(x') = 3 A adist(x') = 4)A 

B\y ( adist(y) = 2 A -1 edge(a;, y) A -1 edge(x', y)) 

where 3! is a shortcut for "there exists unique". Note that if x = tij and 
x ' = i3,j'i then no y as above exists if j' < j, and there are several choices of y, 
in particular, t 2 .j, . ■ . , i2j', if j' > j- 

The main trick of our construction is the ability to "determine the position" 
of each of the intervals corresponding to e^fc with respect to the intervals cor- 
responding to the vertices in V\ and V3. To do so, we use the orderings on the 
intervals corresponding to the vertices of V\ and V3 given by their left end points. 
This ordering is reflected by their neighborhoods in V 2 - 

Let us make this more precise. A vertex x dominates a vertex y if y and all 
its neighbors are also neighbors of x. This property can be expressed using an 
FO formula as follows: 

domin(x, y) = x 7^ y A edge{x 1 y) A Vz (edge(y, z) — > edge(a;, z)) . 

Using this formula, we can write down the FO formula for v as follows: 



Note that v'(x, y) is true if and only if there is an interval that which intersects 
both the intervals corresponding to x £ V{ and y' £ V3, where y' is the mate of 
y, but not the intervals to the left from the interval corresponding for x and to 
the right from the interval corresponding for y' . If such an interval exists, then 
it corresponds to the vertex of W associated with the edge xy. So, the graph H 
is indeed an interpretation of G. □ 

Since many FO properties are W[2]-hard for general graphs, we can immedi- 
ately conclude the following. 

Corollary 5.2. If L is a subset of non-negative reals that is dense in some 
non-empty open set, then FO model checking is W[2]-hard on L-interval graphs. 



v(x,y) 
v'{x,y) 



v (x, y) V v (y, x) , where 

MO) A n{y) A x y A By', z[mates(y, y')A 
edge{x 1 z) A Vt (domin(x, £)—»-■ edge(t, z))A 
edge(y' , z) A Vt (domin(y' , t) — > -1 cdge(t, z)) ] . 
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We now turn our attention to interpretation in unit interval graphs. The 
price we pay for restricting to a smaller class of interval graphs is the strength 
of the interpretation language used, namely that of MSO logic. At this point we 
remark that there exist two commonly used MSO frameworks for graphs; the 
MSOi language which is allowed to quantify over vertices and vertex sets only, 
and MSO2 which is in addition allowed to quantify over edges and edge sets. We 
stay with the former weaker one in this paper. 

An MSOi graph interpretation is defined in the analogous way to former FO 
interpretation with the formulas [i and v being MSOi formulas. 

Lemma 5.3. There is a polynomially bounded simple MSOi interpretation of 
the class of all graphs into the class of unit interval graphs. 

Proof. Suppose that we are given an n-vertex G with n > 4 (the cases with 
n = 1, 2, 3 could be held separately). Our aim is to find a unit interval graph H 
and MSOi formulas fi and v such that (//, v) is an interpretation of G in H . Let 
vi,...,v n be the vertices of G and m the number of its edges. 

As in the proof of Lemma |5.1[ we will describe H by giving its interval 
representation and we again use the closed intervals in the representation. Let 
us introduce the following shorthand notation: if J is an interval and x a real, 
then J + x is the interval J shifted by x to the right. Similarly, if I is a set of 
intervals, then I + x is the set of the intervals from I shifted by x to the right. 

Choose 5 > such that Sn <sc 1 and U = { [iS, 1 + iS] : i = 0, 1, . . . ,n — l}. 
The graph H will contain n(3m+ 1) vertices corresponding to the intervals from 
the sets U + fc(l + 6) for k = 1, . . . , 3m + 1. For further reference, we say that 
the intervals [i5, 1 + id] and [i5, 1 + iS] + fc(l + S) are at the same level i. Let We, 
£ — 0, . . . , m, be the set of the n vertices corresponding to the intervals from 
U + {U + l){l + 5). 

Let A be the multiset containing three copies of the interval [0, 1]. The graph 
H further contains three vertices corresponding to the intervals in A and 3to 
vertices corresponding to the intervals in A + (3i — 1/2) (1 + S), i = 1, . . . , m. 
The vertices of H corresponding to these 3m + 3 intervals will be referred to as 
anchors and they will be the only vertices of H with two twins. 

Fix an ordering of the m edges of G. If the l-th edge joins vertices vj and 
v k , include an additional pair of vertices corresponding to the intervals [jS, 1 + 
jd] + {U+ 1)(1 + 8) and [kS, 1 + kS] + (3i+ 1)(1 + S) to H . The vertices included 
in this step are the only vertices of H that have unique twins. 

Finally, the graph H also contains a vertex corresponding to the interval 
[1/2,3/2]. Observe that the three vertices of H corresponding to the vertices of 
A are the only anchors in H with degree four. The whole construction of H is 
illustrated in Figure [5] 

We now give the MSOi formulas /1 and v such that (/x, v) is a simple MSOi 
interpretation of G in H. First, recall the FO property twin(x, y) from the proof 
of Lemma |5.1| which expresses that the vertices x and y are twins; 

twin(x, y) = (a; / j; A Vz / x, y. edge(x, z) O edge(y, z)) . 
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Fig. 2. An illustration of the interval representation Y (of the graph H) which shows 
the interpretation of an edge ei = VjVk (of the graph G). 

Using this formula, we can give formulas to identify anchors in H and the formula 
to identify vertices not adjacent to any anchors (these are the vertices in the sets 
W\,..., W m and the 2m twins of them included because of the edges of G): 

anchor(x) = By, z(z/yA edge(z, y) A twin(x, y) A twin(x, z)) , 
noanch(x) = Vz ( anchor(z) — > -i edge(a;, z)) . 

The vertices of H used to model the vertices of G are those in the set Wo- These 
are precisely the vertices not adjacent to any anchor with distance two from the 
three anchors of degree four. So, they can be identified using the following FO 
formula. 

fi(x) = noanch(x) A Bt( anchor(t) A deg(t) = 4 A dist(t, x) = 2) . 

We now introduce a notion of mates. Two vertices are mates if they belong 
to the sets W p -\ and W p for p = 1, . . . , 3m and the intervals corresponding to 
them are at the same level. Hence, two vertices x and x' are mates iff they satisfy 
the following formula. 

mates(x, x 1 ) = noanch(x) A noanch(x') A dist(x, x') = 4 A Bt 

[anchor(t) A B\y B\z{-^ edge(y, z) A edge(y,t) A edge(z,t)A 
dist(x, y) = 2 A dist(x' , y) > 2 A dist(x' , z) = 2 A dist(x, z) > 2)] . 

Let mates* (x,y) be the formula describing the transitive closure of the re- 
lation "being mates" (this is the only place in the proof where we need the 
expressive power of an MSOi formula). The edges of G can now be interpreted 
in H using the following formula. 

v{x,y) = n(x) A fj,(y) A x ± y A Bx lt x 2 , yi, yi ( edge(zi, y x )t\ 

mates* (x,xi) A mates* (y,yi) A twin(xi, x 2 ) A twin( y\, y 2 )) • 

Indeed, if x and y belong to Wo, then v{x,y) is true only if there exist vertices 
x\ and yi at the same level as x and y, respectively, that are adjacent (and 
therefore, in the same set W p ) and both x\ and j/i have twins. However, this 
happens only if xy should interpret an edge e p of G. □ 
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Again, we can immediately conclude the following. 

Corollary 5.4. MSOi model checking is W[2]-hard on unit interval graphs. 

This corollary is rather tight since the aforementioned result of Lozin [15] claims 
that every proper hereditary subclass of unit interval graphs has bounded clique- 
width, and hence MSOi model checking on this class is in linear time [3J. 

Lastly, we remark that Fellows et al have shown that testing FO properties 
on unit two- interval graphs (i.e., such that each vertex corresponds to a pair of 
intervals, each on a distinct line) is W[l]-hard. 
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